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RAW SEQUENCE LISTING 

PATENT APPLICATION US/08/785,455 



DATE: 02/25/97 
TIME: 1 1 :24:02 



INPUT SET: SI 5668. raw 



This Raw Listing contains the Generar* jy 
Information Section and up to the first 5 pa; 





SEQUENCE LISTING 



(1) 



General Information 





4 


— — > 


5 


— — > 


D 


— — > 


7 


— — ^ 


Q 

o 




Q 


_ — *> 


i n 




1 1 




1 9 

X ^ 




X <J 


_ _ *v 


XV 




X 3 




X o 


— — > 


1 7 

X r 


_ _ 


1 ft 
X o 


_ _ 


1 0 

X 7 




9f) 




9 1 








9 1 




24 




25 




26 




9 7 




28 




29 




30 




31 




32 




33 




34 




35 




36 




37 




38 




39 




40 




41 




42 




43 




44 




45 




46 



(i) APPLICANT 

(A) NAME: Hodgson, John 

(B) STREET: Great West Road 

(C) CITY: Brentford 

(D) STATE OR PROVINCE: Middlesex 

(E) COUNTRY: United Kingdom 

(F) POSTAL CODE: TW8 9EP 



(A) NAME: Hodgson, John 

(B) STREET: Great West Road 

(C) CITY: Brentford 

(D) STATE OR PROVINCE: Middlesex 

(E) COUNTRY: United Kingdom 

(F) POSTAL CODE: TW8 9EP 

(ii) TITLE OF THE INVENTION: Novel tRNA Synthetase 



(iii) NUMBER OF SEQUENCES : 2 

(iv) COMPUTER- READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: DOS 

(D) SOFTWARE: FastSEQ for Windows Version 2.0 

(V) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1974 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
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47 

48 ATGGCTAAAG AAACATTTTA TATAACAACC CCAATATACT ATCCTAGTGG GAATTTACAT 60 

4 9 ATAGGACATG CATATTCTAC AGTGGCTGGA GATGTTATTG CAAGATATAA GAGAATGCAA 120 

50 GGATATGATG TTCGTTATTT GACTGGAACG GATGAACACG GTCAAAAAAT TCAAGAAAAA 180 

51 GCTCAAAAAG CTGGTAAGAC AGAAATTGAA TATTTGGATG AGATGATTGC TGGAATTAAA 240 

52 CAATTGTGGG CTAAGCTTGA AATTTCAAAT GATGATTTTA TCAGAACAAC TGAAGAACGT 300 

53 CATAAACATG TCGTTGAGCA AGTGTTTGAA CGTTTATTAA AGCAAGGTGA TATCTATTTA 360 

54 GGTGAATATG AAGGTTGGTA TTCTGTTCCG GATGAAACAT ACTATACAGA GTCACAATTA 420 

55 GTAGACCCAC AATACGAAAA CGGTAAAATT ATTGGTGGCA AAAGTCCAGA TTCTGGACAC 4 80 

56 GAAGTTGAAC TAGTTAAAGA AGAAAGTTAT TTCTTTAATA TTAGTAAATA TACAGACCGT 540 

57 TTATTAGAGT TCTATGACCA AAATCCAGAT TTTATACAAC CACCATCAAG AAAAAATGAA 600 

58 ATGATTAACA ACTTCATTAA ACCAGGACTT GCTGATTTAG CTGTTTCTCG TACATCATTT 660 

5 9 AACTGGGGTG TCCATGTTCC GTCTAATCCA AAACATGTTG TTTATGTTTG GATTGATGCG 7 20 

60 TTAGTTAACT ATATTTCAGC ATTAGGCTAT TTATCAGATG ATGAGTCACT ATTTAACAAA 780 

61 TACTGGCCAG CAGATATTCA TTTAATGGCT AAGGAAATTG TGCGATTCCA CTCAATTATT 840 

62 TGGCCTATTT TATTGATGGC ATTAGACTTA CCGTTACCTA AAAAAGTCTT TGCACATGGT 900 

63 TGGATTTTGA TGAAAGATGG AAAAATGAGT AAATCTAAAG GTAATGTTGT AGACCCTAAT 960 

64 ATTTTAATTG ATCGCTATGG TTTAGATGCT ACACGTTATT ATCTAATGCG TGAATTACCA 1020 

65 TTTGGTTCAG ATGGCGTATT TACACCTGAA GCATTTGTTG AGCGTACAAA TTTCGATCTA 1080 

66 GCAAATGACT TAGGTAACTT AGTAAACCGT ACGATTTCTA TGGTTAATAA GTACTTTGAT 1140 

67 GGCGAATTAC CAGCGTATCA AGGTCCACTT CATGAATTAG ATGAAGAAAT GGAAGCTATG 1200 

68 GCTTTAGAAA CAGTGAAAAG CTACACTGAA AGCATGGAAA GTTTGCAATT TTCTGTGGCA 1260 

69 TTATCTACGG TATGGAAGTT TATAAGTAGA ACGAATAAGT ATATTGACGA AACAACGCCT 1320 

70 TGGGTATTAG CTAAGGACGA TAGCCAAAAA GATATGTTAG GCAATGTAAT GGCTCACTTA 1380 

71 GTTGAAAATA TTCGTTATGC AGCTGTATTA TTACGTCCAT TCTTAACACA TGCGCCGAAA 1440 

72 GAGATTTTTG AACAATTGAA CATAAACAAT CCTCAATTTA TGGAATTTAG TAGTTTAGAG 1500 
7 3 CAATATGGTG TGCTTACTGA GTCAATTATG GTTACTGGGC AACCTAAACC TATTTTCCCA 1560 

74 AGATTGGATA GCGAAGCGGA AATTGCATAT ATCAAAGAAT CAATGCAACC GCCTGCTACT 1620 

75 GAAGAGGAAA AAGAAGAGAT TCCTAGCAAA CCTCAAATTG ATATTAAAGA CTTTGATAAA 1680 

76 GTTGAAATTA AGGCAGCAAC GATTATTGAT GCTGAACATG TTAAGAAGTC AGATAAGCTT 1740 

77 TTAAAAATTC AAGTAGACTT AGATTCTGAA CAAAGACAAA TTGTATCAGG AATTGCCAAA 1800 

78 TTCTATACAC CAGATGATAT TATTGGTAAA AAAGTAGCAG TTGTTACTAA CCTGAAACCA 1860 
7 9 GCTAAATTAA TGGGACAAAA ATCTGAAGGT ATGATATTAT CTGCTGAAAA AGATGGTGTA 1920 
80 TTAACCTTAG TAAGTTTACC AAGTGCAATT CCAAATGGTG CAGTGATTAA ATAA 1974 
81 

82 (2) INFORMATION FOR SEQ ID NO: 2: 
83 

84 (i) SEQUENCE CHARACTERISTICS: 

85 (A) LENGTH: 657 amino acids 

86 (B) TYPE: amino acid 

87 (C) STRANDEDNESS: single 

88 (D) TOPOLOGY: linear 
89 

90 (ii) MOLECULE TYPE: peptide 
91 

92 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
93 

94 Met Ala Lys Glu Thr Phe Tyr lie Thr Thr Pro lie Tyr Tyr Pro Ser 

95 1 5 10 15 

96 Gly Asn Leu His lie Gly His Ala Tyr Ser Thr Val Ala Gly Asp Val 

97 20 25 30 

98 lie Ala Arg Tyr Lys Arg Met Gin Gly Tyr Asp Val Arg Tyr Leu Thr 

99 35 40 45 
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1 A A 
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Gly 


Tnr 


Asp 


GlU 
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Gin 


Lys 


lie 


Gin 


uxu 


Lys 


Aia 


Gin 


Lys 


Ala 


1 A 1 
101 




C A 

50 










55 










C A 










1 A O 

102 


Gly 


Lys 


Thr 


GlU 


lie 


GlU 


Tyr 


Leu 


TV — ~— 

Asp 


/"ill! 

GlU 


Met: 


lie 


Ala 


Gly 


lie 


Lys 


1U J 


0 3 










/ u 










/ 5 










A f\ 
oU 


1 A A 

104 


Gin 


Leu 


Trp 


Ala 


Lys 


Leu 


Glu 


t1 _ 

He 


Ser 


Asn 


Asp 


Asp 


Pne 


lie 


Arg 


Tnr 


1 A C 

105 










o c 
85 










O A 

90 










35 




1 A £ 

106 


Thr 


GlU 


Glu 


Arg 


HIS 


Lys 


HIS 


Val 


Val 


Glu 


Gin 


Val 


Phe 


Glu 


Arg 


Leu 


1 A ^ 
10 / 








1 A A 
1UU 










1 A C 

105 










11U 






1 A O 

108 


Leu 


Lys 


Gin 


Gly 


Asp 


He 


Tyr 


Leu 


Gly 


Glu 


Tyr 


Glu 


Gly 


Trp 


Tyr 


Ser 


1 A Q 

10 9 






115 










1 1 A 

120 










IOC 

125 








110 


Val 


Pro 


Asp 


Glu 


Thr 


Tyr 


Tyr 


Thr 


Glu 


Ser 


Gin 


Leu 


Val 


Asp 


Pro 


Gin 


111 




1 O A 
1 JO 










IOC 

135 










140 










112 


Tyr 


Glu 


Asn 


Gly 


Lys 


He 


He 


Gly 


Gly 


Lys 


Ser 


Pro 


Asp 


Ser 


Gly 


His 


113 


145 










150 










155 










160 


114 


Glu 


Val 


Glu 


Leu 


Val 


Lys 


Glu 


Glu 


Ser 


Tyr 


Pne 


Pne 


Asn 


He 


Ser 


Lys 


lib 










165 










1 T A 

170 










175 




116 


Tyr 


Thr 


Asp 


Arg 


Leu 


Leu 


Glu 


Phe 


Tyr 


Asp 


Gin 


Asn 


Pro 


Asp 


Til— — 

Phe 


He 


117 








1 O A 

180 










185 










1 Q A 

190 






118 


Gin 


Pro 


Pro 


Ser 


Arg 


Lys 


Asn 


Glu 


Met 


He 


Asn 


Asn 


Tit- — . 

Phe 


He 


Lys 


Pro 


119 






195 










200 










205 








120 


Gly 


Leu 


Ala 


Asp 


Leu 


Ala 


Val 


Ser 


Arg 


Thr 


Ser 


Phe 


Asn 


Trp 


Gly 


t 9 — T 

Val 


121 




210 










215 










220 










122 


His 


Val 


Pro 


Ser 


Asn 


Pro 


Lys 


His 


Val 


Val 


Tyr 


Val 


Trp 


He 


Asp 


Ala 


TOO 

12 J 


o o c 
225 










O O A 

230 










O 1 c 

235 










O Jt A 

240 


124 


Leu 


Val 


Asn 


Tyr 


He 


Ser 


Ala 


Leu 


Gly 


Tyr 


Leu 


Ser 


Asp 


Asp 


Glu 


Ser 


IOC 

125 










O A C 










OCA 

250 










25 5 




126 


Leu 


Phe 


Asn 


Lys 


Tyr 


Trp 


Pro 


Ala 


Asp 


He 


His 


Leu 


Met 


Ala 


Lys 


Glu 


127 








260 










265 










270 






128 


lie 


Val 


Arg 


Phe 


His 


Ser 


He 


He 


Trp 


Pro 


He 


Leu 


Leu 


Met 


Ala 


Leu 


129 






275 










280 










285 








130 


Asp 


Leu 


Pro 


Leu 


Pro 


Lys 


Lys 


Val 


Phe 


Ala 


His 


Gly 


Trp 


He 


Leu 


Met 


131 




290 










295 










300 










132 


Lys 


Asp 


Gly 


Lys 


Met 


Ser 


Lys 


Ser 


Lys 


Gly 


Asn 


Val 


Val 


Asp 


Pro 


Asn 


133 


305 










310 










315 










320 


134 


lie 


Leu 


lie 


Asp 


Arg 


Tyr 


Gly 


Leu 


Asp 


Ala 


Thr 


Arg 


Tyr 


Tyr 


Leu 


Met 


IOC 

135 










325 










330 










335 




136 


Arg 


Glu 


Leu 


Pro 


Phe 


Gly 


Ser 


Asp 


Gly 


Val 


Phe 


Thr 


Pro 


Glu 


Ala 


Til— — 

Phe 


137 








O A A 

340 










"5 A C 

345 










OCA 

350 






138 


val 


Glu 


Arg 


Thr 


Asn 


Phe 


Asp 


Leu 


TV T _ 

Ala 


Asn 


Asp 


Leu 


Gly 


Asn 


Leu 


Val 


1 Q Q 

i j y 






J5 5 










OCA 

J60 










Job 








1 1 A 

140 


Asn 


Arg 


Thr 


He 


Ser 


Met 


Val 


Asn 


Lys 


Tyr 


Phe 


Asp 


Gly 


Glu 


Leu 


Pro 


i i i 
141 




OTA 

370 










375 










o a a 

380 










142 


Ala 


Tyr 


Gin 


Gly 


Pro 


Leu 


His 


Glu 


Leu 


Asp 


Glu 


Glu 


Met 


Glu 


Ala 


Met 




O Q C 
J O J 
































144 


Ala 


Leu 


Glu 


Thr 


Val 


Lys 


Ser 


Tyr 


Thr 


Glu 


Ser 


Met 


Glu 


Ser 


Leu 


Gin 


145 










405 










410 










415 




146 


Phe 


Ser 


Val 


Ala 


Leu 


Ser 


Thr 


Val 


Trp 


Lys 


Phe 


He 


Ser 


Arg 


Thr 


Asn 


147 








420 










425 










430 






148 


Lys 


Tyr 


He 


Asp 


Glu 


Thr 


Thr 


Pro 


Trp 


Val 


Leu 


Ala 


Lys 


Asp 


Asp 


Ser 


149 






435 










440 










445 








150 


Gin 


Lys 


Asp 


Met 


Leu 


Gly 


Asn 


Val 


Met 


Ala 


His 


Leu 


Val 


Glu 


Asn 


He 


151 




450 










455 










460 










152 


Arg 


Tyr 


Ala 


Ala 


Val 


Leu 


Leu 


Arg 


Pro 


Phe 


Leu 


Thr 


His 


Ala 


Pro 


Lys 
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ICQ 
1 O J 


465 










Q 1 V 






IRA 


Glu 


He 


nU a 


r*l 1 1 

ulU 


Pi n 

uin 


Leu 


Asn 


lie 


1 EC 

10 0 


















ICC 

lob 


Ser 


Ser 


Leu 


ulU 




Tyr 


oiy 


Val 


1 


















1 CO 
1 0 O 


Gly Gin 


Pro 


Lys 


Pro 


lie 


Jrne 


Pro 


ICQ 






CI c 

J J. J 










R 9 fl 
O ^ v 


1 C A 

1 oU 


Ala 


Tyr 


Tl A 

lie 


Lys 


olU 


O A "T* 

be! 


Me x 


r*l n 

bin 


1 fx 1 
lOl 




530 










c oc 

J J J 




lb/ 


Glu 


Glu 


Tl A 

lie 


Pro 


ber 


Lys 


Pro 


oin 


1 £1 
lO J 


545 










ccn 

3 3 v 






1 C A 

1 b 4 


Val 


Glu 


Tl A 

lie 


Lys 


Aia 


Ala 

Aid 


rpVi -r* 

i nr 


Tl q 

lie 


ICC 

loo 










ODD 








loo 


Ser 


Asp 


Lys 


Leu 


Leu 


Lys 


Tl A 

lie 


oin 


1 67 

ID / 








con 










ICQ 
lOO 


Gin 


He 


vai 


bet 


ij±y 


Tl A 

lie 


A 1 o 

Aia 


Lys 


169 






595 










600 


170 


Gly Lys 


Lys 


Val 


Ala 


Val 


Val 


Thr 


171 




610 










615 




172 


Gly Gin 


Lys 


Ser 


Glu 


Gly 


Met 


He 


173 


625 










630 






174 


Leu 


Thr 


Leu 


Val 


Ser 


Leu 


Pro 


Ser 


175 










645 








176 


Lys 
















177 


















178 
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475 










480 


Asn 


Asn 


Dm 


m n 


trim 










490, 










495 




Leu 


l Hi 


ulU 


Ser 


Tl <a 

J. 16 


Mc t 


val 




J U J 
















Arcj 


Leu 


Asp 


del 


nl it 

UlU 


Al A 
.nia 


01.L1 


lie 










525 








Pro 


Pro 


a 1 a 
Ala 


Tnr 


olU 


ulU 


nl 1 1 

ulU 


Lys 








RAO 
*j *± u 










Tl a 

lie 


Asp 


Tl a, 

lie 


Lys 


Asp 


XT 11*2 


Asp 


Lys 






J J 3 












Tl A 

lie 


Asp 


Aia 


blU 


TT , — 

rllS 


Val 


Lys 


Lys 




C T A 
0 1 U 










0/0 




vai 


Asp 


Leu 


Asp 


C* A V 

ber 


pin 


r*l *\ 

bin 


Arcf 


585 










590 






Phe 


Tyr 


Thr 


Pro 


Asp 


Asp 


He 


He 










605 








Asn 


Leu 


Lys 


Pro 


Ala 


Lys 


Leu 


Met 








620 










Leu 


Ser 


Ala 


Glu 


Lys 


Asp 


Gly 


Val 






635 










640 


Ala 


He 


Pro 


Asn 


Gly 


Ala 


Val 


He 




650 










655 





